Audio user interface for displayless electronic device

ABSTRACT

This invention is directed to an audio menu provided in an electronic device having no display. The electronic device can further include an input interface having only a single sensing element (e.g., a single button) for controlling audio playback of the device and for accessing and controlling the device audio menu. In response to a particular input detected by the single sensing element, the electronic device can enable an audio menu mode and play back audio clips associated with different menu options. The user can provide selection instructions using the single sensing element during the playback of an audio clip to select the menu option associated with the played back audio clip. In some embodiments, the audio menu can be multi-dimensional (e.g., the device plays back audio clips for sub-options in response to a selection of a menu option). Suitable menu options can include, for example, groupings of audio (e.g., playlists), options to toggle (e.g., a shuffle option), or options associated with particular metadata tags associated with audio available to the device.

BACKGROUND OF THE INVENTION

This is directed to providing an audio user interface for an electronic device that does not have a display.

Users can interact with electronic devices using different interfaces. For example, if a device includes a display, the device processor can direct the display to display a graphical user interface. The graphical user interface can include information displayed for the user, such as application windows, text or images, or any other suitable information stored locally or retrieved from a remote source (e.g., the Internet or a host device). The graphical user interface can also include displayed selectable options that the user can select to direct the electronic device to perform operations. Such operations can include, for example, operations tied to particular applications, instructions to open or display information, instructions to close or end a process or application, or any other suitable electronic device operation. To select a displayed option, the user can provide an instruction using an input interface coupled to the device.

Not every electronic device, however, includes a display. To control different device operations, the electronic device can have different input interfaces each associated with different operations. For example, the electronic device can include several buttons associated with different operations. In some embodiments, a portable media device can include distinct buttons for controlling playback operations, for example buttons for each of play/pause, next/fast forward, previous/rewind, volume up, and volume down.

As the size of electronic devices reduces, the input interface can become a limiting factor to the size reduction. For example, the size of an electronic device can be reduced so that the device includes no display and only a single input interface (e.g., a single button). As another example, the size of the electronic device can be reduced such that the device does not include an input interface, but rather is coupled to a remote input interface (e.g., coupled to an in-cable button connected to a port of the device). If the device includes only a single input interface and no display, the device can require an audio-based user interface to allow the user to control device operations.

SUMMARY OF THE INVENTION

This invention is directed to systems and methods for providing an audio user interface in a device having only a single input interface and no display.

To control simple audio playback operations, the electronic device can associate different types of inputs provided by the single input interface with different simple operations. For example, the electronic device can associate different combinations of short and elongated presses of a button with different playback controls (e.g., play/pause, fast forward, and rewind). There may be insufficient suitable combinations of inputs using the single input interface, however, to provide instructions for more complex electronic device operations. In addition, a user may require some information before being able to direct the device to perform an operation using a single input interface (e.g., which of several playlists is the user selecting with an input).

The electronic device can include an audio menu operative to provide information to the user regarding the current audio being played back, and subsequently provide specific options that the user can select. Because the electronic device can have no display, the audio menu can include a succession of audio clips defining the available options for the device. For example, in response to receiving a user instruction to access an audio menu, the electronic device can initially play back an audio clip based on metadata characterizing the currently played back audio (e.g., a track announcement of the title and artist of the currently played back music) and then play back audio clips associated with available menu options. The user can provide a selection instruction during the playback of an audio clip to select the menu option associated with the audio clip.

The audio menu can include any suitable selectable option. For example, the selectable options can include playlists (e.g., playlist names or numbers), audiobook titles, options to toggle (e.g., shuffle and genius options), or any other suitable option for selecting all or a subset of the available audio. In some embodiments, the options can include metadata tag values or categories for selecting audio matching a selected tag or category value (e.g., all audio by a particular artist, or in a particular album). The menu can allow a user to refine an audio request using a multi-dimensional menu, for example by providing successive options and sub-options for selecting several different types of metadata tags to define the audio subset to play back.

The electronic device can receive or generate the audio clips to play back in the audio menu using any suitable approach. In some embodiments, the audio clips can be recorded and received from a host device. Alternatively, the audio clips can be generated using a text-to-speech engine of the device or of a host device. The host device can provide any suitable content to the electronic device, including for example audio clips for the audio menu, audio to play back (e.g., music), firmware or software updates, or any other suitable information. In some embodiments, the electronic device can provide text strings for which audio clips are necessary to the host device so that the host device can generate the audio clips using a text-to-speech engine.

BRIEF DESCRIPTION OF THE DRAWINGS

The above and other features of the present invention, its nature and various advantages will be more apparent upon consideration of the following detailed description, taken in conjunction with the accompanying drawings in which:

FIG. 1 is a schematic view of an electronic device in accordance with one embodiment of the invention;

FIG. 2 is a table depicting electronic device operations associated with different inputs from the input interface in accordance with one embodiment of the invention;

FIG. 3 is a schematic view of an illustrative diagram indicating the audio clips played back in the menu mode in accordance with one embodiment of the invention;

FIG. 4 is a schematic view of an array of menu options for which audio clips can be played back in accordance with one embodiment of the invention;

FIG. 5 is a schematic view of a multi-dimensional array of menu options for which audio clips can be played back in accordance with one embodiment of the invention;

FIG. 6 is a flowchart of an illustrative process for accessing an audio menu in accordance with one embodiment of the invention;

FIG. 7 is a flowchart of an illustrative process for interacting with an audio menu in accordance with one embodiment of the invention; and

FIG. 8 is a flowchart of an illustrative process for interacting with a multi-dimensional menu in accordance with one embodiment of the invention.

DETAILED DESCRIPTION

An electronic device operative to provide an audio menu is provided.

The electronic device may include a processor and a user interface having at least one sensing element, for example a mechanical sensing element (e.g., a button), a resistive sensing element (e.g., a resistive sensor), or a capacitive sensing element (e.g., a capacitive sensor). The input interface can be integrated in the device, or remotely coupled to the device (e.g., via a cable or wirelessly). Using the input interface, a user can provide inputs to control media playback. For example, the user can provide play/pause, next track, fast forward, previous track, and rewind instructions by providing different combinations of inputs using the input interface (e.g., different numbers and durations of button presses of a single button coupled to the electronic device). To control more advanced or elaborate electronic device operations, the electronic device may include an audio menu mode that the user can access by providing a particular input using the input interface (e.g., an elongated button press). In some embodiments, the user can control media playback and access and navigate the audio menu by providing inputs to a single sensing element of the input interface (e.g., inputs to a single mechanical button). In addition, the inputs detected by the single sensing mechanism can be of substantially the same type (e.g., button presses, finger contacts, or finger swipes) and inputs that are not associated with selecting an option displayed on a screen (e.g., selecting an option displayed on a capacitive touch screen).

In response to receiving the input for enabling the audio menu, the electronic device can play back an initial audio clip associated with the audio menu. For example, the initial audio clip can indicate the current audio being played back by the device, or the playlist with which the current audio is associated. As another example, the initial audio clip can include an indication of the current mode of operation of the device. Following the initial indication, the electronic device can play back audio clips associated with options for different operations of the device. As the subsequent audio clips are played back, the user can provide inputs for skipping or selecting the associated options. For example, the user can provide a first input (e.g., a single button press using a predetermined button) to select the option associated with the currently played back audio clip, and a second input (e.g., an elongated button press using the same predetermined button) to skip to the audio clip for the next option in the menu.

In some embodiments, the audio user interface can include a multi-dimensional menu. For example, one or more of the selectable options can be associated with subsequent sub-options. Suitable sub-options can include, for example, options related to particular metadata tags of media such as artist name, song title, album name, genre, or any other suitable metadata tag. Using different inputs provided using the input interface, a user can navigate the audio menu to select options or sub-options, return to a previous menu level, or exit the audio menu mode.

FIG. 1 is a schematic view of an electronic device in accordance with one embodiment of the invention. Electronic device 100 may include processor 102, storage 104, memory 106, input interface 108, and audio output 110. In some embodiments, one or more of electronic device components 100 may be combined or omitted (e.g., combine storage 104 and memory 106). In some embodiments, electronic device 100 may include other components not combined or included in those shown in FIG. 1 (e.g., a power supply or a bus), or several instances of the components shown in FIG. 1. For the sake of simplicity, only one of each of the components is shown in FIG. 1.

Processor 102 may include any processing circuitry operative to control the operations and performance of electronic device 100. For example, processor 100 may be used to run operating system applications, firmware applications, media playback applications, media editing applications, or any other application. In some embodiments, a processor may drive a display and process inputs received from a user interface.

Storage 104 may include, for example, one or more storage mediums including a hard-drive, solid state drive, flash memory, permanent memory such as ROM, any other suitable type of storage component, or any combination thereof. Storage 104 may store, for example, media data (e.g., music and video files), application data (e.g., for implementing functions on device 100), firmware, user preference information data (e.g., media playback preferences), authentication information (e.g. libraries of data associated with authorized users), lifestyle information data (e.g., food preferences), exercise information data (e.g., information obtained by exercise monitoring equipment), transaction information data (e.g., information such as credit card information), wireless connection information data (e.g., information that may enable electronic device 100 to establish a wireless connection), subscription information data (e.g., information that keeps track of podcasts or television shows or other media a user subscribes to), contact information data (e.g., telephone numbers and email addresses), calendar information data, and any other suitable data or any combination thereof.

Memory 106 can include cache memory, semi-permanent memory such as RAM, and/or one or more different types of memory used for temporarily storing data. In some embodiments, memory 106 can also be used for storing data used to operate electronic device applications, or any other type of data that may be stored in storage 104. In some embodiments, memory 106 and storage 104 may be combined as a single storage medium.

Input interface 108 may include any suitable interface for providing inputs to input/output circuitry of the electronic device. Input interface 108 may include any suitable input interface, such as for example, a button, keypad, dial, a click wheel, a touch pad, or any combination thereof. The input interface can detect user inputs using at least one sensing element, such as a mechanical sensor, resistive sensor, capacitive sensor, a multi-touch capacitive sensor, or any other suitable type of sensing element. In some embodiments, to minimize the overall dimensions of electronic device 100, input interface 108 can include a limited number (e.g., one) of sensing elements operative to detect inputs to the device. Any suitable event sensed by the sensing element can be used to define an input. For example, an input can be detected when a sensor detects an initial event interaction from the user (e.g., the user presses a button). As another example, an input can be detected when a sensor detects the end of an event or interaction from a user (e.g., the user releases a button). As still another example, an input can be detected both when an interaction is initially detected and when the same interaction ends (e.g., a first input when the user presses and holds the button, for example to enter an audio menu mode, and a second input when the user releases the button, for example to select an option from the audio menu).

To further reduce the size of the device, the one or more sensing elements can be remotely coupled to the device, for example wirelessly or via a wire or cable (e.g., a button embedded in a headphone wire). In some embodiments, the input interface can include an assembly having two volume control buttons and a single playback and menu control button placed between the two volume buttons, where the buttons each include a single mechanical sensing element and the assembly is positioned on a headphone wire. In such embodiments, no input interface can be located on the electronic device (e.g., or only a hold switch).

In some embodiments, the input interface can include several sensing elements for controlling media playback, enabling a menu mode, and providing menu navigation and selection instructions. For example, the input interface can include a limited number of sensing elements for controlling the various device operations. In one implementation, the electronic device can include several buttons, each associated with mechanical sensing elements (e.g., dome switches), where different combinations of the several buttons (e.g., two or three buttons) can be used to control media playback, access an audio menu, select audio menu options, and navigate the audio menu. For example, a first button can be used to control media playback, access the audio menu, and select audio menu options, and a second and third button can be used to navigate between options and sub-options within the audio menu.

Audio output 110 may include one or more speakers (e.g., mono or stereo speakers) built into electronic device 100, or an audio connector (e.g., an audio jack or an appropriate Bluetooth connection) operative to be coupled to an audio output mechanism. For example, audio output 110 may be operative to provide audio data using a wired or wireless connection to a headset, headphones or earbuds. In some embodiments, input interface 108 can be incorporated in a portion of audio output 110 (e.g., embedded in the headphone wire).

One or more of input interface 108 and audio output 110 may be coupled to input/output circuitry. The input/output circuitry may be operative to convert (and encode/decode, if necessary) analog signals and other signals into digital data. In some embodiments, the input/output circuitry can also convert digital data into any other type of signal, and vice-versa. For example, the input/output circuitry may receive and convert physical contact inputs (e.g., from a touch pad), physical movements (e.g., from a mouse or sensor), analog audio signals (e.g., from a microphone), or any other input. The digital data can be provided to and received from processor 102, storage 104, memory 106, or any other component of electronic device 100. In some embodiments, several instances of the input/output circuitry can be included in electronic device 100.

In some embodiments, electronic device 100 may include a bus operative to provide a data transfer path for transferring data to, from, or between control processor 102, storage 104, memory 106, input interface 108, sensor 110, and any other component included in the electronic device. Such other components can include, for example, communications circuitry, positioning circuitry, motion detection circuitry, or any other suitable component. In some embodiments, communications circuitry can be used to connect the electronic device to a host device from which media such as audio, metadata related to the audio, and playlists or other information for managing the received audio.

The electronic device can perform any suitable operation in response to detecting particular inputs from the input interface. FIG. 2 is a table depicting electronic device operations associated with different inputs from the input interface in accordance with one embodiment of the invention. Table 200 can include column 210 of inputs and column 220 of associated electronic device operations. Column 210 can include any suitable input, including presses of a button (e.g., as illustrated in FIG. 2). For example, column 210 can include single press 211, extended single press 212, double press 213, double extended press 214, triple press 215, and triple extended press 216. Different electronic device operations can be associated with each of the inputs. For example, column 220 can include play/pause 221, menu 222, next track 223, fast-forward 224, previous track 225, and rewind 226, each associated with the input from the corresponding row in column 210. In some embodiments, other inputs can be associated with particular electronic device operations, such as longer combinations of button presses (e.g., four or more button presses), or combinations of short and long presses (e.g., several long presses, or consecutive short and long presses, such as in Morse code). Other electronic device operations associated with the inputs can include, for example, non-media playback operations such as communications operations (e.g., telephone, text message or e-mail operations), information display operations (e.g., displaying weather, traffic, or mapping information), or operations for accessing a remote database (e.g., web browsing operations or remote access operations).

To access more advanced media operations (e.g., non playback control operations), a user can provide an input associated with a menu command or operation. In the example of FIG. 2, this input can include a single extended input using the input interface (e.g., a single extended press of a button). In response to detecting the menu command input, the electronic device can enable a menu mode of the device. The electronic device can provide any suitable indication of the menu mode. FIG. 3 is a schematic view of an illustrative diagram indicating the audio clips played back in the menu mode in accordance with one embodiment of the invention. Diagram 300 can include a timeline beginning at end 302, and separated into several sections by vertical lines. During section 304, the electronic device can duck the currently played back audio (e.g., played back music). For example, the electronic device can duck the audio by a predetermined amount (e.g., reduce the audio by 50%). As another example, the electronic device can duck the audio to a predetermined value (e.g., reduce the audio to 30% of the audio output). The predetermined value can include a volume setting (e.g., 30% volume), an energy level of the audio (e.g., 30% of the highest energy or audio wave amplitude of the music), an audio strength amount (e.g., a number of dB). Section 304 can have any suitable duration, including for example a duration in the range of 50 ms to 800 ms, such as 300 ms. At the end of section 304, the electronic device can pause the currently played back audio, or continue to playback the audio at the ducked audio level.

Once the played back audio has been ducked and paused, or simply ducked, the electronic device can announce the currently playing back audio during section 310. For example, the electronic device can play back an audio clip associated with the title, artist, album, or any combination thereof, of the currently played back audio. The electronic device can play back audio clips associated with any suitable information for the currently played back audio. For example, the audio clips can be associated with metadata tags of the played back audio. As another example, the audio clips can include a portion of the played back audio (e.g., a sample of the played back audio). The electronic device can play back audio clips for any suitable combination of tags or data associated with the played back audio, including for example the artist name and audio title (e.g., for a song) or the book name and chapter name (e.g., for an audio book). In embodiments where the played back media is ducked and continues to be played back during section 310, the electronic device can set the volume of the track announce audio clip at a level substantially higher than the ducked media.

Once the currently playing back audio has been announced, the electronic device can pause during section 320 to allow the user to exit the menu mode without selecting new audio to playback, or without hearing the menu options. For example, a user can exit the audio menu mode by providing a second selection instruction during section 320, or ending the selection instruction used to initially access the audio menu mode (e.g., release a button press that was held during sections 304 and 310). Section 320 may thus be of particular interest when users wish only to identify the currently playing back song (e.g., and cannot simply view a displayed artist name and title, as there is no display with the device). Section 320 can have any suitable duration, including for example a duration in the range of 100 ms to 1200 ms, such as 500 ms. Once the duration of section 320 lapses, the electronic device can provide an audio indication that the menu options will be provided. For example, during section 322, the electronic device can provide an audio tone or beep to indicate the end of the track announce portion of the audio menu. In some embodiments, the electronic device can instead provide the menu options without first providing an audio indication.

Following the audio tone of section 322, the electronic device can provide, in section 330, audio associated with selectable menu options. The audio clips provided in section 330 can include clips for any suitable option, including for example playlists, audio books, options to toggle for controlling playback (e.g., shuffle on/off or a seed-based playlist on/off options), or any other suitable audio compilation. The particular options for which audio clips are played back will be described in more detail below. In particular, the seed-based playlist option can be related to the Genius playlist option available from iTunes, available from Apple Inc., by which a playlist of media related to a seed can be automatically generated.

The audio clips associated with the audio menu (e.g., with the sections of diagram 300) can be played back in any suitable manner. In some embodiments, the audio clips can be automatically played back sequentially (e.g., auto-played) in response to accessing the audio menu (e.g., in response to a first input to enter the menu mode, such as a single, elongated press of a button, as indicated in table 200, FIG. 2). The user can then provide a subsequent input at any time during the playback of audio clips. In response to a subsequent input prior to or during the audio tone of section 322, the electronic device can exit the audio menu mode, re-increase the ducked audio (and resume playback of the paused media item, if it was paused during section 304 or section 310), and continue to play back the audio played back prior to entering the menu mode. In response to a subsequent input after the audio tone of section 322 (or after the duration of section 322 lapses), the electronic device can identify the playlist or other option associated with the audio clip played back when the subsequent input was received. The electronic device can then play back audio from the identified playlist, or perform the operation associated with the selected option. In addition, the electronic device may associate particular inputs of the input interface with exiting the menu mode (e.g., even after playing the audio tone and beginning to play back playlists), or skipping forward and back between audio options. For example, a single elongated input detected by the sensing element (e.g., a single press of a button) while in the menu mode can be associated with exiting the menu mode, a double input detected by the sensing element (e.g., a double press of a button) can be associated with skipping to the next audio clip, and a triple input detected by the sensing element (e.g., a triple press of a button) can be associated with skipping to the previous audio clip. As another example, inputs detected by different sensing elements such as, for example volume controls (e.g., two buttons associated with volume up and volume down commands) can be associated with navigating menu options while in the menu mode.

In some embodiments, the user can instead hold an input upon entering the menu mode. To exit the menu mode without providing a playlist selection, the user can release the input prior to or during the audio tone of section 322. To provide a selection of a playlist or other option, the user can release the input upon the device playing back the audio clip associated with the playlist or option of interest.

In some embodiments, the audio clips of the audio menu may not automatically be played back sequentially. Instead, the user can control the playback of the audio clips by providing navigation instructions. Such navigation instructions can include, for example, a single elongated press while in the menu mode to exit the menu mode, a double press to skip to the next audio clip, and a triple press to skip to the previous audio clip. As another example, volume controls (e.g., two buttons associated with volume up and volume down commands) can be associated with navigating menu options while in the menu mode.

The electronic device can provide any suitable menu options in the menu mode. FIG. 4 is a schematic view of an array of menu options for which audio clips can be played back in accordance with one embodiment of the invention. Array 400 can include several options of different types. In some embodiments, array 400 can include playlist options 402, 404 and 408. Each playlist can be identified in any suitable manner, including for example by playlist name (e.g., as set on a host device from which the playlists were received), a number, metadata associated with one or more audio items in the playlist, or any other suitable identifying information. In some embodiments, array 400 can also include options for audio books stored on the electronic device (e.g., instead of or in addition to playlist options). The playlists and audio books can be ordered in any suitable manner in array 400. In some embodiments, the order may be based on the order in which the playlists were added to the electronic device from a host device (e.g., the order in which the playlists were dragged into the electronic device using a host device application, such as iTunes available from Apple Inc.). Other suitable orders can include, for example, alphabetical (e.g., based on playlist title or metadata values), numerical (e.g., based on the number of audio items in each playlist, or on a numerical order associated with each playlist), or on any other suitable attribute of the playlists. In some embodiments, the playlist order can change based on which playlist is being played back upon entering the menu mode. For example, the electronic device can re-order the playlists to start with the current playlist (e.g., and continue with the next playlist in the set order, or continue with the first playlist in the set order).

Array 400 can include additional options, such as all songs option 410, and options that can be toggled on or off. For example, array 400 can include shuffle option 420 and genius option 422, which the user can select to turn on or off. In response to receiving a selection of a toggled option, the electronic device can change the value of the option, and either exit the menu mode, or return to the menu mode and continue or restart playing back audio clips associated with the available menu options (e.g., audio clips associated with the options of array 400). In some embodiments, the audio clips associated with toggled options can include an indication of the value of the option (e.g., the audio clip can be “shuffle on” or “shuffle off” based on the current value of the toggled option). In some embodiments, array 400 can include a repeat option and an exit option (not shown) to allow the user to either repeat the available options or to exit the audio menu mode and return to the previously played back audio. In some embodiments, array 400 can include an audio classification parameter or classification value related to the audio available for playback on the device.

In some embodiments, the audio user interface can include a multi-dimensional menu. FIG. 5 is a schematic view of a multi-dimensional array of menu options for which audio clips can be played back in accordance with one embodiment of the invention. Array 500 can include several options of different types, some or all of which can be associated with subsequent sub-options. In some embodiments, array 500 can include track announcement indication 502 for identifying the current audio played back by the device upon entering the menu mode. Track announcement indication 502 and the subsequent selectable options of array 500 can be separated using any suitable approach, including for example using an audio tone or beep (e.g., as discussed above), or without an indication to the user (e.g., options are separated by time lapses and no audio indications). Following indication 502, for example, array 500 can include several selectable options associated with metadata or other tags of audio available for playback. For example, array 500 can include titles option 510, artists option 520, albums option 530, genres option 540, and playlists option 550. Any other suitable tag or information used to classify or identify audio can be used instead or in addition to those shown in array 500. Such tags can include, for example, audiobooks, album artist, audio rating, popularity (e.g., as determined from a remote popularity index), or any other suitable tag.

In response to receiving a user selection of one of options 510, 520, 530, 540 and 550, the electronic device can provide audio clips associated with the sub-options for the particular selected option. For example, in response to receiving a user selection of titles option 510, the electronic device can play back audio clips of song titles 512. As another example, in response to receiving a user selection of artists option 520, the electronic device can play back audio clips of artists 522. As still another example, in response to receiving a user selection of albums option 530, the electronic device can play back audio clips of albums 532. As yet still another example, in response to receiving a user selection of genres option 540, the electronic device can play back audio clips of genres 542. As still another example, in response to receiving a user selection of playlists option 550, the electronic device can play back audio clips of available playlists 552 stored on the device.

If the number of sub-options associated with a selected option is too large (e.g., exceeds a pre-defined maximum value), the electronic device can instead provide an intermediate sub-option further classifying the sub-options. For example, the electronic device can provide audio clips associated with the letters of the alphabet, and subsequently the sub-options beginning with a selected letter. Alternatively, the electronic device can associate particular inputs of the input interface with quick navigation operations to allow for more rapid traverse of the sub-option audio clips (e.g., inputs for skipping forward and back between initial letters, or options for fast forwarding or rewinding the playback of audio clips for the sub-options). As still another example, the electronic device can detect particular inputs for navigating directly to particular sub-options (e.g., detect inputs in Morse code to skip directly to a particular letter or number, for example after enabling Morse-code based inputs).

The electronic device can perform any suitable operation in response to receiving a user selection of a sub-option. In some embodiments, the electronic device can play back audio associated with the selected option (e.g., playback all audio associated with the selected artist, or all of the audio in the selected album or playlist). In some embodiments, the electronic device can provide options for selecting other metadata tags associated with the audio clips related to the selected sub-option. For example, in response to receiving a user selection of an album, the electronic device can provide audio clips associated with playing back the album, the album artist, and titles within the album. As another example, in response to receiving a user selection of an artist, the electronic device can provide audio clips associated with playing back the audio associated with the artist, the albums by the artist, titles of audio by the artist, or playlists having audio by the artist.

In some embodiments, array 500 can include playlist seed option 560. In response to a user selection of playlist seed option 560, the electronic device can play back audio clips associated with information 562 used to identify a playlist seed. For example, the electronic device can play back audio clips for audio titles to serve as seeds directly, or the electronic device can instead or in addition play back audio clips for tags used to classify audio (e.g., artists, titles and genres options). In response to receiving a user selection of a tag sub-option, the electronic device can provide further sub-options for particularly identifying one or more audio items to use as a seed for the seed-based playlist (e.g., sub-options associated with one or more of options 510, 520, 530, 540 and 550).

A user can navigate a multi-dimensional audio menu using any suitable approach. In some embodiments, a user can select an option for which an audio clip is played back. For example, the user can provide an instruction using the input interface (e.g., detected by a sensing element). In response to detecting the instruction, the electronic device can play back audio clips associated with the sub-options of the selected option. The user can then further navigate along the audio menu by continuing to select sub-options to direct the device to play back audio clips associated with further sub-options of the selected sub-option. If an electronic device operation is associated with a selected sub-option (e.g., an instruction to play back audio by a particular user), the electronic device can perform the option in response to receiving an appropriate user selection. The user can in addition navigate up the audio menu (e.g., from a sub-menu to a parent menu) using a different input than the selection input. For example, the user can provide a single input to the sensing element (e.g., a single button press) to provide a selection instruction, and a double input to the sensing element (e.g., a double button press) to navigate up a menu level.

In some embodiments, the audio clips played back by the electronic device can be context-sensitive. For example, the electronic device can identify in the audio clip the current value of a toggled option (e.g., shuffle on or off). As another example, the electronic device can identify the particular selected option to play in an audio clip associated with a play back instruction (e.g., “Play artist Coldplay,” “Play title Viva la Vida,” or “Play audiobook The Hobbit”).

The electronic device can generate the audio clips to play back for the selectable options or sub-options using any suitable approach. In some embodiments, the electronic device can include a text-to-speech engine and sufficient processing to generate the audio clips on the device. Alternatively, the electronic device can receive the audio clips associated with the selectable options from a host device. The host device can generate the audio clips using any suitable approach, including for example using a text-to-speech engine. In some embodiments, the host device can include more substantive processing capabilities than the electronic device, which can allow the host device to generate more polished or accurate audio clips (e.g., audio clips that account for accents or languages). Using a text-to-speech engine can allow the electronic device to generate audio clips for all of the audio menu options (e.g., as opposed to identifying recorded audio clips for menu options, which can not all be available (e.g., no audio clips can be available for less common titles or artist names).

In host device text-to-speech engine embodiments, the host device can identify text strings from which to generate audio clips using any suitable approach. In some embodiments, the host device can identify text strings associated with the electronic device firmware or operating system from the electronic device manufacturer (e.g., as part of a firmware download from the manufacturer when the electronic device firmware is updated). In some embodiments, the electronic device can instead or in addition provide text strings to the host device for synthesizing. For example, the electronic device can provide text strings to the host device as part of a synching protocol when the electronic device is coupled to the host device. In some embodiments, the electronic device can identify the text strings based on the data provided to the electronic device by the host device. For example, the host device can use metadata tags associated with the audio to be transferred to the electronic device as text strings for which to provide audio clips.

The following flowcharts describe illustrative processes used in connection with the audio user interface of one embodiment of the invention. FIG. 6 is a flowchart of an illustrative process for accessing an audio menu in accordance with one embodiment of the invention. Process 600 can begin at step 602. At step 604, the electronic device can determine whether an input from an input interface was detected. For example, the electronic device can determine whether an input was received from a single sensing element of an input interface coupled to the device. If the electronic device determines that no input was received, process 600 can return to step 604 and continue to monitor for electronic device inputs. If, at step 604, the electronic device instead determines that an input was received, process 600 can move to step 606.

At step 606, the electronic device can determine whether the detected input is associated with accessing an audio menu. For example, the electronic device can determine whether the detected input matches the input associated with accessing an audio menu (e.g., a single elongated press of a button). If the electronic device determines that the detected input is not associated with accessing an audio menu, process 600 can move to step 608. At step 608, the electronic device can perform a playback operation associated with the detected input. For example, the electronic device can determine that all non-audio menu inputs are associated with playback control, and direct the electronic device to perform the playback operation associated with the detected input. Process 600 can then end at step 610.

If, at step 606, the electronic device instead determines that the detected input is associated with providing an audio menu, process 600 can move to step 612. At step 612, the electronic device can provide an audio menu. For example, the electronic device can play back audio clips for several selectable options. Process 600 can then end at step 614.

FIG. 7 is a flowchart of an illustrative process for interacting with an audio menu in accordance with one embodiment of the invention. Process 700 can begin at step 702. At step 704, the electronic device can duck currently played back audio. For example, the electronic device can decrease the volume of the audio to a particular level. The particular level can be determined based on the volume value, a measurement of the music energy, the decibel output of the audio, or any other suitable measure. At step 706, the electronic device can determine whether an input was received during ducking. For example, the electronic device can determine whether an input was received from the input interface (e.g., an input detected from a single sensing element) used to access or enable the audio menu mode. If the electronic device determines an input was received, process 700 can move to step 708.

At step 708, the electronic device can exit the audio menu. For example, the electronic device can cease the audio menu playback (e.g., cease ducking, track announcement, or playing back menu options) and return to the playback of the previous audio. In some embodiments, the electronic device can increase the electronic device volume to the volume level before enabling the audio menu. Process 700 can then end at step 710.

If, at step 706, the electronic device instead determines that no input was received during ducking, process 700 can move to step 712. At step 712, the electronic device can announce the played back track prior to enabling the audio menu. For example, the electronic device can identify an audio clip associated with metadata of the played back audio (e.g., an audio clip for the audio title and artist). At step 714, the electronic device can determine whether an input was received during the track announcement. For example, the electronic device can determine whether an input was received from the input interface used to access or enable the audio menu mode. If the electronic device determines an input was received, process 700 can move to step 708, described above. If, at step 714 the electronic device instead determines that no input was received during the track announcement, process 700 can move to step 716.

At step 716, the electronic device can pause (e.g., leave a moment of silence to allow the user to register the track announcement and decide whether or not to exit the audio menu mode) and provide an audio tone to indicate that menu options will be provided. For example, the electronic device can pause for a predetermined duration (e.g., 500 ms) and provide an audio tone following the pause. At step 718, the electronic device can determine whether an input was received during the pause and the provided audio tone. For example, the electronic device can determine whether an input was received from the input interface used to access or enable the audio menu mode. If the electronic device determines an input was received, process 700 can move to step 708, described above. If, at step 718 the electronic device instead determines that no input was received during the pause and audio tone, process 700 can move to step 720.

At step 720, the electronic device can play back menu options associated with an audio menu. For example, the electronic device can play back consecutive audio clips associated with menu options playback. At step 722, the electronic device can determine whether an input was received during the menu option. For example, the electronic device can determine whether an input to select an audio mode option was received from the input interface. If the electronic device determines that no input was received, process 700 can return to step 720 and continue to play back audio clips for audio menu options. If, at step 722, the electronic device instead determines that an input was received, process 700 can move to step 724. At step 724, the electronic device can perform an operation associated with the menu option selected by the detected user input. For example, the electronic device can play back audio associated with a selected option. Process 700 can then end at step 710.

FIG. 8 is a flowchart of an illustrative process for interacting with a multi-dimensional menu in accordance with one embodiment of the invention. Process 800 can begin at step 802. At step 804, the electronic device can play back menu options associated with an audio menu. For example, the electronic device can play back consecutive audio clips associated with menu options. At step 806, the electronic device can determine whether an input was received during the playback of an audio clip associated with a menu option. For example, the electronic device can determine whether an input was received to select one of the options for which an audio clip was played back. If the electronic device determines that no input was received, process 800 can return to step 804 and continue to play back audio clips for audio menu options. If, at step 806, the electronic device instead determines that an input was received, process 800 can move to step 808. At step 808, the electronic device can play back audio clips for sub-options associated with the selected option. For example, the electronic device can play back audio clips for artist names in response to receiving an input selecting an artist option.

At step 810, the electronic device can determine whether an input was received during the playback of an audio clip associated with a sub-option. For example, the electronic device can determine whether an input was received to select one of the sub-options for which an audio clip was played back. If the electronic device determines that no input was received, process 800 can return to step 808 and continue to play back audio clips for audio menu sub-options. If, at step 810, the electronic device instead determines that an input was received, process 800 can move to step 812.

At step 812, the electronic device can determine whether the input detected at step 810 was an input associated with an instruction to go up a menu level. For example, the electronic device can determine whether the input was associated with a “back” instruction (e.g., an elongated press of a button). If the electronic device determines that the detected input is associated with an instruction to go up a menu level, process 800 can move to back to step 804 and play back audio clips for menu options one level up from the sub-option level of step 808. If, at step 812, the electronic device instead determines that the input detected was not an input associated with an instruction to go up a menu level, process 800 can move to step 814. At step 814, the electronic device can determine whether the input detected at step 810 was an input for a sub-option associated with an electronic device operation. For example, the electronic device can determine whether the input was associated with a selection instruction (e.g., a single press of a button) for a sub-option option associated with an electronic device operation. If the electronic device determines that the detected input is not associated an electronic device operation (e.g., the sub-option selected is not associated with an electronic device operation), process 800 can move to step 816. At step 816, the electronic device can play back audio clips for subsequent sub-options associated with the selected sub-option. For example, the electronic device can identify subsequent sub-options for the selected sub-option, and play back audio clips associated with the identified subsequent sub-options. Process 800 can then return to step 810 and monitor for inputs during the playback of the audio clips.

If at step 814, the electronic device instead determines that the input detected was an input for a sub-option associated with an electronic device operation (e.g., the selected sub-option is associated with an electronic device operation), process 800 can move to step 818. At step 818, the electronic device can perform an electronic device operation associated with the selected sub-option. For example, the electronic device can play back audio identified by the sub-option (e.g., audio by the selected artist name sub-option). Process 800 can then end at step 820.

The above-described embodiments of the present invention are presented for purposes of illustration and not of limitation, and the present invention is limited only by the claims which follow. 

What is claimed is:
 1. A method for controlling an electronic device using a single sensing element, comprising: playing back with the electronic device a first audio clip in a first manner; during the playing back in the first manner: detecting with the single sensing element a first user input; in response to the detecting the first user input, altering with the electronic device the playing back from the first manner to a second manner; and in response to the altering, playing back with the electronic device the first audio clip in the second manner; during the playing back the first audio clip in the second manner, playing back with the electronic device a second audio clip to announce the first audio clip; and after the playing back the second audio clip, playing back with the electronic device a first menu audio clip that is associated with a menu of the electronic device.
 2. The method of claim 1, wherein a first menu option is associated with each of the first menu audio clip that is played back and an electronic device operation, the method further comprising: detecting with the single sensing element a second user input; and in response to the detecting the second user input, accessing with the electronic device the first menu option.
 3. The method of claim 1, wherein the second audio clip that is played back corresponds to at least one of a title, an artist, an album name, an audiobook title, a chapter name, a genre, and a year that is associated with the first audio clip that is played back.
 4. The method of claim 1, further comprising: after the detecting the first user input, entering with the electronic device a menu mode of the electronic device; and at least one of during the playing back the first audio clip in the second manner and after the playing back the first audio clip in the second manner: detecting with the single sensing element a second user input; and exiting with the electronic device the entered menu mode in response to the detecting the second user input.
 5. The method of claim 4, wherein the detected first user input comprises a single press and hold of the single sensing element, and wherein the detected second user input comprises a release of the pressed and held single sensing element.
 6. The method of claim 4, wherein each of the detected first user input and the detected second user input comprises a single elongated press of the single sensing element.
 7. The method of claim 1, further comprising: detecting with the single sensing element a second user input during the playing back the first menu audio clip; and in response to the detecting the second user input, skipping the playing back the first menu audio clip to play back a second menu audio clip.
 8. The method of claim 1, wherein the single sensing element comprises at least one of a mechanical sensor, a resistive sensor, and a capacitive sensor.
 9. The method of claim 1, wherein the altering the playing back to the second manner comprises ducking the first audio clip that is played back in the first manner.
 10. The method of claim 9, wherein the ducking comprises reducing a volume of the playing back the first audio clip in the first manner.
 11. The method of claim 10, wherein the reducing the volume comprises adjusting the volume to a predefined volume level that is audible.
 12. The method of claim 1, wherein the playing back the first audio clip in the second manner comprises playing back the first audio clip at a first volume level, and wherein the playing back the second audio clip comprises playing back the second audio clip at a second volume level that is higher than the first volume level.
 13. The method of claim 1, wherein the detected first user input comprises a single extended press of the single sensing element.
 14. The method of claim 1, wherein the second audio clip that is played back corresponds to metadata that is associated with the first audio clip that is played back.
 15. The method of claim 1 further comprising, after the playing back the second audio clip, but before the playing back the first menu audio clip: playing back a third audio clip to indicate an end of the announcing of the first audio clip.
 16. The method of claim 15, wherein the third audio clip that is played back comprises at least one of a tone and a beep.
 17. The method of claim 1, wherein the first menu audio clip that is played back is associated with a first menu option of the menu.
 18. The method of claim 17 further comprising, during the playing back the first menu audio clip: detecting with the single sensing element a second user input; and in response to the detecting the second user input, accessing with the electronic device the first menu option.
 19. The method of claim 1, wherein, after the detecting the first user input, but before the playing back the first menu audio clip: accessing with the electronic device a menu mode of the electronic device; and after the accessing, identifying with the electronic device at least the first menu audio clip.
 20. An electronic device for providing an audio menu to a user, comprising: a single sensing element for detecting user inputs; an audio output; and a processor operative to: direct the audio output to play back a first audio clip in a first manner; during the playback of the first audio clip in the first manner, receive from the single sensing element a first user input that is detected by the single sensing element; in response to the receiving the first user input, direct the audio output to alter the playback from the first manner to a second manner; during the playback of the first audio clip in the second manner, direct the audio output to play back a second audio clip to announce the first audio clip; and after the playback of the second audio clip, direct the audio output to play back a first menu audio clip that is associated with the audio menu.
 21. The electronic device of claim 20, wherein the processor is further operative to direct the audio output to alter the playback to the second manner by directing the audio output to duck the first audio clip that is being played back in the first manner.
 22. The electronic device of claim 21, wherein the audio output ducks the first audio clip by reducing a volume of the playback of the first audio clip in the first manner.
 23. The electronic device of claim 22, wherein the reducing the volume comprises reducing the volume to a predefined volume level that is audible.
 24. The electronic device of claim 22, wherein the volume is reduced over a duration in the range of 50 ms to 800 ms.
 25. The electronic device of claim 20, wherein: the single sensing element comprises a mechanical sensor; the detected first user input comprises an elongated press of the mechanical sensor; and the detected second user input comprises a subsequent press of the mechanical sensor.
 26. The electronic device of claim 20, wherein: the single sensing element comprises a mechanical sensor; the detected first user input comprises a press and hold of the mechanical sensor; and the detected second user input comprises a release of the mechanical sensor.
 27. The electronic device of claim 20, wherein the processor is further operative to: during the playback of the first menu audio clip, receive a second user input from the single sensing element that is detected by the single sensing element; and after the receiving the detected second user input, direct the audio output to play back a second menu audio clip that is associated with the audio menu.
 28. The electronic device of claim 20, wherein the audio output is coupled to at least one of an earbud, headphones, and a speaker.
 29. The electronic device of claim 20, wherein the audio output is operative to play back the first audio clip in the second manner by playing back the first audio clip at a first volume level, and wherein the audio output is operative to play back the second audio clip by playing back the second audio clip at a second volume level that is higher than the first volume level.
 30. The electronic device of claim 20, wherein the second audio clip that is played back corresponds to metadata that is associated with the first audio clip.
 31. The electronic device of claim 20, wherein the processor is further operative to, after the directing the audio output to play back the second audio clip, but before the directing the audio output to play back the first menu audio clip: direct the audio output to play back a third audio clip to indicate an end of the announcing of the first audio clip.
 32. The electronic device of claim 31, wherein the third audio clip that is played back comprises at least one of a tone and a beep.
 33. The electronic device of claim 20, wherein the first menu audio clip that is played back is associated with a first menu option of the menu.
 34. The electronic device of claim 33, wherein the processor is further operative to, while the audio output is playing back the first menu audio clip: receive from the single sensing element a second user input that is detected by the single sensing element; and in response to the receiving the second user input, access the first menu option.
 35. The electronic device of claim 20, wherein the processor is further operative to, after the receiving the first user input, but before the directing the audio output to play back the first menu audio clip: access the audio menu of the electronic device; and after the accessing, identify at least the first menu audio clip.
 36. The electronic device of claim 20, wherein the electronic device does not comprise a display.
 37. The electronic device of claim 20, wherein the single sensing element is the only user input interface of the electronic device.
 38. A non-transitory computer-readable media for controlling an electronic device using a single input interface, the non-transitory computer-readable media comprising computer-readable instructions for: directing an audio output of the electronic device to play back a first audio clip in a first manner; during the playback of the first audio clip in the first manner, receiving a first user input from the single input interface; in response to the receiving, directing the audio output to alter the playback from the first manner to a second manner; during the playback of the first audio clip in the second manner, directing the audio output to play back a second audio clip to announce the first audio clip; and after the playback of the second audio clip, directing the audio output to play back a first menu audio clip that is associated with an audio menu of the electronic device. 